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Estimation  of  Hilbert  space 
valued  parameters  by  the  method  of  sieves. 

I.  Introduction  By  now  the  method  of  sieves,  first  suggested  by  Grenander,  has 
become  a  fairly  common  technique  in  the  infinite  dimensional  parameter  estima¬ 
tion.  There  are  several  specific  examples  (see  Grenander  (1981),  Geman  (1981), 
Karr  (1987))  where  this  method  has  yielded  consistent  estimators  with  good  asymp¬ 
totic  properties.  However,  the  situation  is  quite  different  when  it  comes  to  finding 
general  conditions  which  imply  the  consistency  of  a  sieve  estimator  and  give  an  idea 
about  its  rate  of  convergence. 

The  first  step  in  this  direction  was  taken  by  Geman  &  Hwang  (1982)  who  general¬ 
ized  the  Wald’s  (1949)  proof  of  the  consistency  of  the  maximum  likelihood  estima¬ 
tor  of  a  finite  dimensional  parameter  to  the  case  of  infinite  dimensional  parameter 
via  the  method  of  sieves.  They  give  conditions  on  the  likelihood  function  and  the 
parameter  space  (v'hich  is  assumed  to  be  a  Polish  space)  so  that  there  exists  a  sieve 
such  that  the  corresponding  sieve  estimator  is  consistent.  This  result  is  quite  gen¬ 
eral  but  it  gives  no  information  about  the  rate  of  convergence  of  the  resultant  sieve 
estimator  and  moreover,  the  observations  are  assumed  to  be  independent  and 
identically  distributed  (i.i.d.).  The  other  results  of  this  type  (see,  e.g.,  Karr  ( 1987)) 
assume  some  additional  structure  on  the  likelihood  function  as  well  as  on  the 
parameter  space. 

Here  we  generalize  the  ideas  of  Ibragimov  &  Hasminski  (in  finite  dimensional 
parameter  estimation)  and  obtain  a  large  deviation  inequality  for  a  sieve  estimator 
estimating  a  Hilbert  space  valued  parameter.  This  inequality  suggests  a  method  of 
consistent  estimation  of  Hilbert  space  valued  parameters  using  the  sieve  estimators 
which  correspond  to  the  sieves  consisting  of  finite  dimensional,  compact,  convex 
sets.  This  method  naturally  provides  the  convergence  rates  of  the  resultant  estima¬ 
tors  and  the  observations  do  not  have  to  be  i.i.d..  The  usefulness  of  this  approach 
is  demonstrated  by  applying  it  to  two  examples;  in  the  first  one,  the  drift  function 
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in  a  linear  stochastic  differential  equation  is  estimated  and  in  the  next,  the  intensity 
function  of  a  nonstationary  Poisson  process  is  estimated.  In  both  cases,  a  detailed 
discussion  of  the  convergence  rates  of  our  estimators  and  how  they  compare  with 
the  other  estimators  proposed  in  the  literature  is  given. 

The  article  is  arranged  in  the  following  manner:  the  rest  of  this  section  is  devoted 
to  the  brief  explanation  of  our  approach.  Section  II  consists  of  the  basic  large  devia¬ 
tion  inequality,  its  derivation  and  the  consequences.  Section  III  contains  the  first 
example  of  the  drift  function  estimation  and  Section  IV  contains  the  Poisson  inten¬ 
sity  estimation. 

Let  (X  i,X  2,...,X„ )  be  an  n  -dimensional  random  vector  (X^s  can  be  general 
random  objects)  with  joint  distribution  PnQ.  Assume  that,  for  each  n,  Pn 0 
posseses  a  density,  /“(9),  w.r.t.  a  a-finite  measure  vn  and  the  unknown  parame¬ 
ter  0  is  an  element  of  ©,  a  subset  of  a  real,  separable,  infinite  dimensional  Hilbert 
space  H.  The  goal  is  to  estimate  0  consistently  using  the  observations 
(Xj,X2 . XJ. 

Let  ( gj),j>  1;  be  an  arbitrary  but  fixed  set  of  independent  vectors  from  H 
and  Sk,k>  1,  be  & -dimensional,  compact,  convex  subsets  in  the  subspaces  of  © 
spanned  by  ( gj),j  <  k  such  that: 

i)  Sk  c  Sk  +  l, 

ii)  u  Sk  is  dense  in  ©  and 

iii)  For  every  n  and  k,  there  exists  a  (random)  point  Qkn  such  that 

/"(0*n)=  sup  fn(V). 

0  €oj 

That  is,  ( Sk )  is  a  sieve  consisting  of  finite  dimensional,  compact,  convex  sets;  Qkn 
is  called  a  sieve  estimator  (for  each  n,  it  is  merely  a  restricted  maximum  likelihood 
estimator). 

*  _ 

For  0€  0,  let  0*  denote  its  projection  on  Sk.  Then,  since  both  Qk  and  Qk  are 
finite  dimensional,  using  the  techniques  of  Ibragimov  &  Hasminski  (1981),  one  can 


obtain  an  upper  bound  on  the  probability: 


(  1.1  j 


■  3  - 


where  1 1.  1 1  denotes  the  norm  in  H  ,  and  [3  >  0  and  h  >  0  are  constants. 


Now  (ii)implies  that,  for  every  kn  — >  « ,  the  deterministic  difference  (bias) 
110^  -  011  — »  0.  Therefore,  to  estimate  0  consistently,  it  is  enough  to  estimate  0* 
consistently  for  some  kn  — »  <»  (i.e.  the  error  in  estimating  0*  should  tend  to  zero 
as  n  — »  oo).  Thus  it  is  sufficient  to  show  the  existence  of  a  subsequence  kn  such 
that,  for  every  h  >  0  and  some  (3  >  0,  the  bound  for  the  probability  in  (1.1)  tends 
to  zero.  In  fact,  this  also  means  that  the  rate  of  convergence  of  the  sieve  estima¬ 
tor,  0t  ,  is  atleast  n ^  if,  in  addition,  we  have  n ^  I  I0t  —911  — >  0. 


In  Theorem  2.1  an  exponential  bound  is  obtained  for  the  probability  in  (1.1) 
under  conditions  simitar  to  those  of  Ibragimov  &  Hasminski  (1981,  Theorem  5.1. 
Ch.l).  In  Corollary  2.1  sufficient  conditions  are  stated  for  the  existence  of  a  subse¬ 
quence  kn  such  that  the  sieve  estimator  is  consistent.  In  the  examples  con¬ 
sidered  in  Sections  III  &  IV  the  conditions  of  this  corollary  are  verified. 


II.  Let  {  un,  Pq"  }  be  a  family  probability  spaces  indexed  by  n  >  l 
and  06  0.  Points  (observations)  from  Xn  wiU  be  denoted  by  Xn.  Assume  that 
the  parameter  space  0  is  a  subset  of  some  fixed  infinite  dimensional  separable  Hil¬ 
bert  space  H  .  Also  assume  that,  for  every  n  there  exists  a  a-linite  measure  v'1 
such  that  P Qn  is  absolutely  continuous  w.r.t.  vn  for  all  0e  0.  Let  fn(Xn,B ) 
denote  the  corresponding  density. 

Definition  1  (sieve)  A  sequence  [Sk  )  of  subsets  of  0  is  called  a  sieve  if  u  S*  is 
a  dense  set  in  0  and  for  each  n  and  k  the  maximum  of  fn(Xn,B)  over  Sk  is 
attained  in  Sk . 


Definition  2  (sieve  estimator)  An  estimator  Bkn{Xn)  is  called  a  sieve  estima¬ 
tor  corresponding  to  the  sieve  (5*  )  if  it  is  a  point  in  Sk  such  that. 


1 


□  □ 
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fn(XH,Qkn)  =  sup  fn(X\t). 
i  e  St 

Our  aim  is  to  study  the  asymptotic  behavior  of  a  sieve  estimator  0*  n  corresponding 
to  a  given  sieve  (Sk),k’>  1,  of  the  following  type  : 

i)  for  each  k  >  1,  Sk  is  a  compact,  convex  set  in  the  span  of 
{g; }  1  <  j  <  k\  where  {gj }  is  some  fixed  set  of  independent  vectors  from  0. 

ii)  Sk  c  S|c  +  1  and  U  Sk  is  dense  in  0  . 

For  any  0e  ©  ,  let  0*  denote  its  projection  in  Sk  i.  e. 

1 1  0-  0l  11=  min  1 1  0  -  t  II. 

teSt 

Remark  2.1  :  Of  course,  our  setup  does  not  guarantee  the  existence  of  such  a  sieve, 
we  are  assuming  that  we  already  have  such  a  sieve. 

Since  Sk  is  compact  and  convex,  0*  always  exists  and  is  unique.  Further¬ 
more,  110-0*11  — >  0  as  k  — >  «>. 

Let  0  6  0  be  the  true  parameter. 

Let  <&r  k  be  kxk  nondegenerate  normalizing  matrices  with  real  elements 
such  that,  IO”*  I  4  0  as  n  »  where  !<t>%  I  denotes  the  operator  norm 
of  On*.  Note  that  <t>" t  can  be  considered  as  a  linear  operator  on  the  span  of 
{gj  },  1<  j<  k  in  an  obvious  way.  With  this  understanding  let  us  define  real 
valued  random  functions  Znk  (u)  with  domains  Unk  as  follows  : 

U\  =  t®\r‘  <  Sk  -  e,  )■  (2  1) 

and  for  u  €  Un k , 

Z\  (u)  =  ln(  0*  +  )  -  ln(  Qk  )  (2.2) 

where,  for  0e  0,  l(Xn,Q)  =  ln(Q)  is  the  loglikelihood  function  of  the  data  at  0. 
Then,  since  Qkn  maximizes  /n ( 0)  in  Sk  and  Z%( 0)  =  0  it  is  easy  to  verify  that 
{  I  )~  1  (  n  ~  9*  )  1 1  ^  h  }  c  {  sup  Zn k  (u)>  0  ).  Therefore  the  fol- 

I  \u  1 1>  h 

lowing  inequality  is  true: 
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pQn{  \u<t>nkrx(Qlcn-ek )  n>  h }  <  pQn{  sup  z\(u, >o\.  <:.h 

i  'u  1 1>  h 

Under  suitable  assumptions  on  the  functions  Znk(u)  the  probability  on  the  RHS 
above  can  be  bounded. 

Let  G  denote  the  following  class  of  functions  : 
g  €  G  if  and  only  if, 

i)  g  :  {  0,  «  )  — »  R  1  and  there  exists  .r0  such  that  g  is  positive  and  strictly 
increasing  on  /  .r0  oo  ). 

ii)  For  all  N  >  0, 

oo 

j  >’v  expi'  ~g(y)  )  dy  <  oo. 
o 


Throughout  this  article  C,  with  or  without  a  subscript,  will  denote  a  positive  con¬ 
stant  independent  of  n  and  k\  it  need  not  always  be  the  same.  Also,  (  r  }  will 
denote  the  largest  integer  smaller  than  x. 


Theorem  2.1  Suppose  that  for  each  k  €  N  the  following  conditions  hold  : 

Cl)  There  exist  numbers  a=a(k  )>  k  ,  m=m(k  )>  a  and  positive  constants  Bn  k 
and  p  =  p{k)  such  that  for  all  R  >  0, 

sup  £%  \Znk  (u)  -  Znk  (v)  \m  <  Bnk  (\  +  RP)  IIm-v  I  la 

: :«  1 1,  i  iv  1 1<  r  , 

C2)  There  exists  gr>ic£  G  such  that  for  some  r|>  0, 

EnQ  expf  r\Znk(u )  )  <,  exp  (  -  gnk  ( I  lu  1 1)  ) 

for  all  u  6  Un  k  . 

Then,  for  h  e  [  xQ(n  ,k  ),  »  )  , 

P*n{  U(*nkrl(Qkn-9k)U>h}Z  5,  £  (\+h  +  r)B'gnkrexp(- blg\r  ) 

r=  0 


where 
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i)  x  q(  n  ,k  )  is  a  point  such  that  gn  k  is  positive  and  increasing  on  [x  {)(n,k),°o  ) . 

‘0  8n kr  '  gn k  (h  +  r). 

in  ox  =  bx(k  )  =  - a~k  -  >  0. 

a-k  +  mlc 

J_ 

iv)  B  x  =  B  x(n,k  )  =  C  (  Bn  k  )m  D  , 


D  =  (24) 


2*  +  4 


■( 


1 


-Ui-k) 
1-2  m 


■+  ■ 


m-ct 


if  m  >  a  and 


D  =  (24)2*  +  4.(  - l- -  ) 

1-2  m 


if  m  =  a. 

v)  B  2  =  B  2(k  )  = 

m 


From  this  result  a  useful  corollary  can  be  deduced  which  gives  sufficient  condi 
tions  for  the  existence  of  a  consistent  sieve  estimator. 


Corollary  2.1  Assume  the  conditions  of  Theorem  2.1 .  Furthermore  assume  that: 

i)  The  constants  m  =  m(k),  p  =  p(k)  and  a=  a  (k  )  in  condition  Cl  are  constant  multi 

pies  of  k  .  the  multiplying  constant  being  independent  of  k. 

tiiBnk  <  C  *  nc k  k  Ck  for  some  C 

Hi)  gnk  (*)=  nh  k^  It -a(n,k)  lv  -  t(n,k) 

where  5,  and  V  are  fixed  positive  numbers  and  t(n,k  )  and  aln.k  )  are  non 
negative  constants  which  may  depend  on  n  and  k. 
iv)  For  some  number  s  >  0  the  following  holds: 
a)  8-  p.  s  -  2s  >  0 
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c  l  ( n ,  n  ) 

b)  a(n,n  )  and  — -  tend  :o  zero  as  n  — >  °°  . 

ns 

Then  /'  is  a  consistent  estimator  of  9  and  moreover,  for  n  sufficiently  lar-ye. 
n>  n0  (h  ), 

PnQ  {  I  en'n  -en,  )\>  h  }<  C  exp  r-C,  n5~^-s  h v'  > 

where  the  constants  C  and  C  j  >  0  depend  on  9  alone. 


Proofs  :  First  a  result  from  Ibragimov  &  Hasminski  (1981)  (Theorem  19, 
Appendix)  is  stated.  This  result  gives  conditions  on  a  random  function  defined 
on  R  k ,  so  that  it  will  have  continuous  paths.  Furthermore,  the  conditions  give  a 
very  useful  bound  on  the  expectation  of  the  modulus  of  continuity  of 

Theorem  A.l  Let  be  a  real  valued  random  function  defined  on  F .  a 

closed  set  in  R  k  .  We  shall  assume  that  the  process  E,(t)  is  measurable  and  separable 
Also  assume  that  the  following  conditions  are  fulfilled  : 

I)  There  exist  m  >  a>  k  such  that  for  all  x  6  F  , 

Elixir  <  H(x) 

II)  For  h  €  R  k  such  that  (x  +  h)  e  F, 

E  I  ^(x  +  h)  -  \(x)  r  <  H{x)  \h  la 

where  H  (x  )  is  some  positive  continuous  function  defined  on  F . 

Then  with  probability  one  the  realizations  of  c,(t)  are  continuous  functions. 

Moreover,  if  (h L  )  =  sup  {  I  ^(x)-^(y)  I  )  where  the  sup  is  taken  over 
x  ,y  e  F  with  Ir  -y  I  <  h  ,  Lt  I,  ly  I  <  L  ; 
then 

1  a-  k 

E  u(h,  5,  L  )  <  D(  k,a,m  )  (L  k  sup  H(x))m  h  m  In  h~l 

U  I  <  L 

where  the  constant  D  (k  ,a,m  )  can  be  bounded  by 


1 
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242*  +  4  ( 


1 


-(a-*) 
1-2  m 


(Inza) 

2  m  - 1 


■(a-k  ) 


if  m  >  CL  and  if  m  =  CL  then  the  second  term  in  the  above  product  (4(1-2 


This  bound  on  D  is  not  explicitly  given  in  Ibragimov  &  Hasminski  (1981)  but 
can  be  obtained  by  carefully  going  over  their  proof;  the  additional  factor  of  ln/i-1 
in  the  bound  on  the  modulus  of  continuity  also  seems  to  be  necessary  when 
m  =  a.  The  numerical  value  of  this  bound  in  terms  of  k ,  m  and  a  is  very  impor¬ 
tant  since  this  bound  will  have  to  be  taken  into  account  while  determining  the 
subsequence  kn  which  controls  the  growth  of  the  sets  in  the  sieve.  The  details 
of  the  derivation  of  this  bound  are  quite  lengthy  and  so  they  are  omitted,  interested 
reader  can  find  them  in  Selukar  (1989,  Appendix). 

Proof  of  Theorem  2.1:  For  r  =  0,l,2, . let 

yrn  =  {  u  e  Un k  :  h  +  r  <  I  \u  1 1  <  h  +  r+  1  }.  (2.4) 

We  will  show  that, 

PQn  [  sup  Znk(u)>  0]<  B  j  (\+h  +  r)Blgnkr  e\p(-b{gnkr  )  (2.5) 

U  6  Yr" 

where  b\,  B  i  &  B  2  are  as  in  Theorem  A.l.  (2.5)  will  prove  the  assertion  of  the 
theorem  since,  from  (2.3), 

VPi^rVe^-e*;  n>  h}<  pQn(  sup  z\(u)>  0/ 

I lu  1 1>  h 


^  £  Pdn[  sup  znk(u)  >  0  1 

r=0  “eYr" 

^  B  l(l+h  +  r)B2gnkrexp(-blgnkr); 

0 
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which  is  tiie  assertion  of  the  theorem.  Now'  we  proceed  to  prove  (2  4)  : 

We  subdivide  the  set 

y  rn  =  {  ue(Jnk  \h  +  r<  I  \u  I  i  <  /t  +  r+1  }  into  N  regions,  each  with 
diameter  at  most  2(1  +h  +  r)&,  5  to  be  specified  later.  This  subdivision  can  be 
done  such  that,  N  <  (S)-*.  Denote  the  above  subdivision  of  y rn  by, 
y r\n  -  Y r2  >■■■! rNn  and  let  Uj  be  any  fixed  point  in  y  rjn .  Then, 

P Qn[  sup  Znk  (u)  >  OJ  <  'Z  p&n  {zn k  (u.)  >  } 

“£7/  j=  l  - 

+  PQn  {  sup  (  \  Zn k(u)  -  Zn k(v)  I  )  >  ~  }.  (2.6) 

I  lu  —  v  Ik  26*  2 

1 1«  I  (,  I  hr  I K  !♦>  A  r 

(  here,  8*  =  ( \  +  h  +  r)  8) 

The  first  term  on  the  RHS  above  can  be  bounded  using  condition  C2  and  the 
Chebyshev’s  inequality  as  follows  : 

X  P q1  {  Zn k(iij)  >  )  <  N  e2  exp  (  - g\r  )  (2.7) 

y=i 

(  Wuj  \  \  e  ( h  +  r  ,\+  h  +  r ]  implies  that  gn  k  ( I  lu  j  II)  >  gn  k  (h  +r )  =  gn  kr ) 
Also  since,  Zn  k  (0)  =  0  condition  Cl  implies  that 

sup  E%  I  Z\(u)  <  B\(\+RP)Ra.  (2.8) 

llu  II  <  R 

Hence,  in  view  of  condition  Cl,  (2.8)  and  Theorem  A. 1  we  get  that, 

E%  <o(  2&(l+ h  +  r);  Z\,  (\  +  h  +  r))  <  D  [Bn k  ( \  + ( \+ h  + r  ))p*ajm 

A.  a-  k 

(\+h  +  r)m  [2b(\+h  +  r)J  m  ln/28(  1+  h  +  r)]'  1 

J_  a-k. 

<  D  (Bnk  (2+h  +  r)P*2a  ]m  [2§1  m  ln8_1.  (2.9) 
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Using  (2.9)  and  the  Chebyshev’s  inequality  the  second  term  in  the  RHS  oft  2.6) 
can  be  bounded  as  follows 

/v  {  sup  I Z\(u)  -  Z\(v)\ >  ~  } 

lu  -  V  1<  2S‘  2 

la  I.  Iv  W  1- A  ♦  r 

1  a-  k 

<  4 D  [Bnk  (2+  h  +  r  )p+2aJm  8  m  ln8~l.  (2.10) 

Thus  from  (2.6),  (2.7)  and  (2.10)  we  get, 

P9"  (  sup  Z\(u)>  0/ 

u€  y," 

J1  —  a~k 

<  N  e  2  e\p(-gnkr)  +  4D[Bnk(2+h  +  r)P+2a]m5  m  ln8“l.  (2.11) 


So  far  we  have  not  chosen  5.  We  will  choose  8  such  that  N  exp  (~gn  kr)  and 
a-k 

8  m  are  of  the  same  order  of  magnitude.  Let 

a-k 


8  =  exp(- 


m 


(a-k  )+mk 
since,  N  <  8~k , 


■gnkr).  Then  8  m  =  expf- 


a-k 


(a-k  )+mk 


gnkr)<  and 


N  exp (~gnkr)  <  5“*  exp (~gnkr  ) 


,  a-k  n  , 
=  exp(-— — — g  kr). 


(a-k  )+mk 


(2.12) 


From  (2.1 1)  and  (2.12)  we  get. 


P%nl  sup  Z\(u)>  0/ 

«eyr" 


<  expr-  —  ~  t—g\r)(e  2  +  4D(B\ (2+h  +  rr2alm  g\rl 


(a-k  )+mk 


Hence 


PQn[  sup  Z\(u)>  0 !<,  Bx(\+h  +  r)Bltxp(-bxgnkr)gnkr 
u  e  y," 
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which  completes  the  proof. 

Proof  of  Corollary  2.1:  Let  gn(x)  =  g”  );  then 
g„U)  =  C  n5~^s  lx-a(n)  lv-t(n) 

where 

a(n)  =  a(n,[ns])  and  t(n )  =  t(n  ,[ns  ]).  It  is  easy  to  check  that  gn  (x  )  is  posi¬ 
tive  and  increasing  on  [jc0(/i),»)  where 


(2.13) 


x0(n)  =  a(n)+  { 


c 


tin) 

[/i5] 


} 


In  view  of  assumption  (iv)  (a)  and  (b), 

8-|J.s-.s  >  8-  ps-  2s  >  0  and  a(n)  and  t(n)  are  such  that 


a(n)  — »  0  and 


t(n) 

[ns] 


0. 


Therefore  x0(n )  -»  0  as  n  — >  Hence  for  every  h  >  0.  there  exists  n0(  h  ) 
such  that  for  all  n  >  nQ(h),  x0(n)  <  h.  Then  the  following  inequality  is  a 
consequence  of  Theorem  2.1. 

p\  ( H  r  rl  ( k‘n  -*n‘)  n>  h ) 


<  S,  £  (l  +  /t  +  r)fli  &xp{-blgn(h  +  r)) 

r=  0 

where  because  of  the  assumptions  (i)  &  (ii)  of  the  Corollary. 

a-k  C  C 


(2.14) 


b  i  =  &,(*)  = 


a-k  +  mk 


B 


p+2a 


m 


C, 


B{  =  C  (B\)m  D(k,r,m)  <L  C  (24)c*  kc  nc 


=  C  (24)c("'lnc 
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and  I gn(h  +  r)  l<  nc  kc  (h  +  r)c .  (In  all  of  the  above  expressions  the  constant 
C  denotes  a  positive  number  independent  of  n  but  it  could  be  different  in  each  of 
them.) 

This  means  that  the  expression  in  (2.14)  can  be  written  as 

P\{  II  ( <J>\.  r1  (  en,n  -  en. )  II  >  h  ) 

<  C  nc  (24)C[n’]  £  (l+/t  +  r)fi2exp(--^—  gn(h  +  r)) 

r= o 

<  (  C  nc  (24'c["'>exp( - 

2[/i'] 

x  £  (\  +  h  +  r)B  2  exp  ( - ~g„(h  +  r))  (2.15) 

r=Q  [rtS] 

(  since  gn(h  +  r)  >  gn(h)  ). 

*  For  large  n  the  first  factor  in  (2.15)  can  be  made  arbitrarily  small,  in  particular 
smaller  than  1.  To  see  this,  note  that  for  large  enough  n 

,  x  h  j  t (n )  ^  1 

a(n)  <  —  and  - <  — 

2  [ns]  2 

then 

— ■ — gn(h)  ~  —  (  Cn \h-a(n)  lv-t(n)  ) 

[«s]  [ns] 

>  C  n(b~^s~s)  /zv-  — . 

2 

Therefore 

Cnc  ( 24)ct'I,lexp( - —gn(h) 

2[ns] 

<,  C  nc  (24)c["^exp  (-C  n<5-^"j)Av+  y  ) 
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=  C  nc  (24)C[n,]exp( -C  n(8-^s~s)  hv ). 

However  since  h  >  0  and  8-(J.5-5  >  s  the  RHS  above  tends  to  zero. 

Thus  for  large  n  , 

p\[  ii  fDvr'ffl/  -  e„,  j  ii  >  h  } 

<  £  {\+h  +  r)Bl  exp( -  — ^—gn(h  +  r) ) 

r=0  [«  ] 

<  exp( ■  n{8~^s~s)  hv)  £  (1+  h  +  r)Bi  exp  ( - — — gn(h  +  r)) 

Z  r=0  2[«5] 

<  exp(-yn(5-i|s-i)Av) 

x  V  ( 1+  h  +  r)B*  exp  ( - —  (/,+  r)v  ) 

rTo  2[ns) 

2  (h  +  r) 

.  I  because  for  large  n  ,  — - >  C  n(S-V-s~s)  (h  +  r)v.) 

[«*] 

<  exp(--y /i(5~^~5)fcv).  (2.16) 

The  last  step  follow's  because,  using  the  Dominated  convergence  theorem  it  is  easy 
to  see  that  the  second  factor  in  the  product  above  tends  to  zero  as  n  — »  .  The 

assertion  in  (2.16)  essentially  proves  the  Corollary  since,  \(  O'1  n,  )_1  I  — »  0  as 
n  — »  o°  implies  that 

I9n»n  -  9„»  I  -»  0  as  n  — » 

and  then  we  only  need  to  observe  that  the  deterministic  difference 
1 9  -  9n,  I  always  — >  0  as  n  —>  «= . 

III.  Drift  function  estimation  in  linear  SDE: 

Consider  the  following  model: 
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dx,  =  9,  xt  dt  +  d\Vt,  ,t0  =  Xq,  0<  t<  1;  (3.1) 

where  (VF,)  is  the  standard  Weiner  process  and  the  initial  value  X  0  is  a  zero 
mean  Gaussian  random  variable  with  varience  equal  to  cr02.  Furthermore,  X0  is 
assumed  to  be  independent  of  (IF',). 

The  goal  is  to  estimate  (0f),  an  unknown  deterministic  function,  using  a 
sample  of  n  independent  trajectories  of  ( x, ). 

We  assume  that  9e  L  2([0,l],<ir),  that  is,  the  parameter  space 
0=  L  2([0, \],dt).  At  this  point  we  make  an  assumption  that  C5q2>  0.  This 
assumption  is  not  very  essential  except  that,  in  the  presence  of  this  assumption  the 

■y 

consistency  can  be  obtained  in  L  "([0,1], dt)  and  the  statements  of  the  results  are 
simpler  whereas  if  this  assumption  is  violated  i.e.  if  X q=  0  a.s.  ,  then  the  con¬ 
sistency  is  obtained  in  L  2([0,  l],<J02(f  )dt)  where  o02(r)  =  E  (jc,2);  (see  Nguyen 
&  Pham  (1982)  Remark  pp  609). 

The  above  problem  was  first  discussed  in  a  paper  by  Nguyen  and  Pham 
(1982).  They  proposed  a  sieve  estimator  and  showed  its  weak  mean  square  con¬ 
sistency;  no  regularity  assumptions  on  9  were  made.  However,  the  rate  of  conver¬ 
gence  of  this  estimator  was  not  discussed.  Later  Kutoyants  (1984)  proposed  a  ker¬ 
nel  estimator  and  proved  its  weak  pointwise  consistency  under  some  mild 
differentiability  assumptions  on  9.  He  also  discusses  the  rate  of  convergence  of 
this  estimator.  Here  we  propose  a  sieve  estimator  which  is  based  on  the  method 
suggested  in  the  last  section.  It  turns  out  that  our  estimator  is  a  smoothed  version 
of  the  estimator  proposed  by  Nguyen  &  Pham.  We  will  see  that  unless  some  kind 
of  regularity  assumption  on  9  is  made  the  method  in  Section  II  docs  not  yield  the 
consistency  of  the  proposed  estimator,  however,  under  mild  regularity  conditions 
on  9  this  estimator  is  shown  to  be  mean  square  consistent  as  well  as  uniform  norm 
consistent;  the  rate  of  convergence  being  dependent  upon  the  amount  of  regularity 
assumed.  We  first  define  the  necessary  notation,  state  our  results  and  then  com¬ 
pare  them  with  the  results  of  Nguyen  &  Pham  and  Kutoyants.  The  proofs  are 
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given  at  the  end  of  the  section. 

It  is  easy  to  verify  that  (3.1)  has  a  unique  strong  solution  (.t,)  given  by 

t  t  t 

xt  =  x0  exp (J  Qudu)  +  j  exp( j  Qudu)dWs.  (3.2) 

0  0  s 

Note  that  ( x, )  is  a  continuous  Gaussian  process  with  the  mean  fuction  identically 
equal  to  zero,  and  if  C02(r )  =  Ext 2  then  it  can  be  shown  that 

t  t  i 

oe2(t)  =  a02  exp (2j  Qudu)  +  j  exp(2 J  9 Udu)ds. 

0  Os 

Obviously  o02(r)  is  a  continuous  function  on  [0, 1].  Let 

M  0  =  sup  a02(f)  and  me=  inf  ae2(t); 
o<  i<  i  o<  t<  1 

ther,  since  [0,1]  is  compact  and  Oe2( t)  continuous.  M  e<  ~  Also,  mQ>  0 
since  a02>  0.  This  means  that,  for  /  e  ©, 

m 9 1 1/  1 12<  1 1/  1  l2e<  M  Q 1 1/  1 12  (3.3) 

1  l 

where  1 1/ 1 120=  J /t2a02(r)cfr  and  11/11 2=  J  f2dt. 

o  o 

For  9e@,  let  P 0  denote  the  measure  induced  by  the  paths  of  ( xt )  on 
(C,C)  where  C  is  the  space  of  continuous  functions  on  [0,1]  endowed  with 
the  supremum  norm  topology  and  C  is  the  Borel  a-field  of  subsets  of  C  (that  is, 
it  is  generated  by  the  open  sets  in  C  ).  Then  it  is  well  known  that  (see,  for  exam¬ 
ple,  Liptser  and  Shiryayev  (1977),  vol.  I,  Theorem  7.19,  pp  277)  PQ  and  P0  are 
mutually  absolutely  continuous  (P0  denoting  the  measure  P0  when  9=  0)  and, 

dPe  }  j  *  , 

-—(co)  =  exp  ( J  9, CD, d (j}[  -  — J  9f 2co, 2dt )  (3.4) 

o  1  o 


where  co=  (co, )  e  C . 
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Suppose  9  =  (0,),  a  point  in  ©,  is  the  true  parameter. 

Our  aim  is  to  estimate  9,  using  a  sample  of  n  independent  trajectories  of 

CM- 

Let  C"  be  the  n-fold  product  of  C,  Cn  the  n-fold  product  of  C  and 
Pq1  the  n-fold  product  of  PQ.  Also,  to"  will  denote  a  typical  point  in  C" . 

With  this  notation,  the  family  of  probability  spaces  under  consideration  can  be 
described  as  the  triplet,  {C",Cn,P0n,0€0}. 

The  sieve  Sk  to  be  used  is  defined  as  follows: 

Let  { f  j  }  be  an  arbitrary  but  fixed  complete  orthonormal  system  (CONS)  in 
0  =  L  2([0, \],dt)  and  for  k  >  1, 

Sk  =  { 0€0 : 9  =  x  0y  fj ;  i  e,2  ^  k2q  };  (3.5) 

j=  i  j=  i 

q  >  0  will  be  determined  later.  Let  9*  =  0*  (to")  be  the  sieve  estimator 

corresponding  to  this  sieve.  Now  we  are  in  position  to  state  the  main  result  of  this 
section,  the  large  deviation  inequality  for  9*".  However,  we  first  have  to  define 
the  domain  of  applicability  of  this  result. 

For  0  6  0,  let  Pk 0  and  fi*  be  the  projections  of  0  on  the  linear  span  of 
{/  j,  1<  j<  k  )  and  Sk  respectively,  {fj }  being  the  CONS  used  in  defining  the  sieve 

Sk .  (  clearly,  for  large  k  ,  k>  K  e,  PkQ=  Q* )  For  0<  £<  —  let 

©£  =  {6e©  :n1~E  110-  PnS  II2  —*  0  as  n  ->  «  ).  (3.6) 

That  is,  0  e  0e  if  and  only  if 

nl~e  S  2  0  as  n  «>  (3.7) 

j=n* 

where  0;  ’s  are  the  Fourier  coefficients  of  9  w.r.t.  { /;  ). 
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From  the  definition  it  is  easy  to  see  that, 

£[  <  e2  =>  @£l  c  @e2;  (3.8) 

that  is,  smaller  £  implies  higher  "smoothness”.  It  is  clear  that  the  type  of  functions 
which  belong  to  ©e  depends  on  the  CONS  used  in  defining  the  sieve  Sk  and  for 
an  arbitrary  CONS  it  is  difficult  to  ascertain  whether  a  given  function  is  a  member 
of  ©£  or  not.  However,  for  some  special  choices  of  CONSs  it  is  possible  to  iden¬ 
tify  large  classes  of  functions  which  belong  to  ©e.  One  such  choice  of  CONS 
(  fj  }  is  illustrated  below: 

Set  /  [  =  1  and  for  j  >  2,  let 

fj  (r)  =  V2cos(rtjr)  for  even  j's  and 

fj  (r)  =  V 2  sin  [ft  (j+  1)  r]  for  odd  j's;  t  e  [0,1].  (3.9) 

For  m  >  0  and  0  <  a  <  1  let  @m  a  be  the  class  of  m -times  differentiable 
functions  with  m-th  derivative  Lipschitz  continuous  of  order  a.  More  precisely, 

©m.a  =  {  0  :  0  is  m  times  continuously  differentiable 

with  m-th  derivative  Lipschitz  continuous  of  order  a. 

Furthermore ,  for  0<  j  <  m, 

H  J 

00) (0)  =  ( 1)  where,  8^  (t)  =  —  0  (t )  } (3.10) 

dt J 

If  the  CONS  in  (3.9)  is  used  in  defining  the  sieve  Sk  then  it  can  be  shown  that  (see 
Corollary  3.5), 

©m.ac  ®c  when  E>  -r+2-^— )-■  O.m 


Theorem  3.1 

0  <  £  <  — . 
2 


Assume  that  the  true  parameter  6  belongs  to  ©E  for  some  E. 
Choose  P  >  0  and  q  >  0  such  that 
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b  =  1  -  2(3  -  e  -  2eq  >  £  >  0. 

Then,  for  all  n  sufficiently  large  and  h>  0, 

PnQ  {  II  enS  -  0„e  II  >  h  }  <  exp  (  -  C  nb  h1  ) 
where  C  >  0  ,  depends  on  0  alone. 

Now  the  Borel-Cantelli  lemma  and  the  above  result  give  us  the  next  theorem. 

Theorem  3.2  Assume  that  9  e  ©£  for  some  £,  0  <  £  <  Choose  P  >  0 

and  q  >  0  such  that 

1  -  2p  -  2e  -  2e q  >  0.  Then. 

n P  1 1  %ntn  -  0  II  — >  0  a.s.  as  n  -> 


For  the  following  results  it  is  assumed  that  the  CONS  { f  j  }  used  in  defining  the 
sieve  Sk  is  the  one  given  in  (3.9).  Then  the  next  result  is  just  a  restatement  of 
the  last  theorem  using  the  fact  given  in  (3.11). 


Theorem  3.3 

(m  +  a)-y 
H  2(m  +  a)+  1 

n 


Let  06  0m  a, 


m  +  a  > 


2  ’ 


Then  for  every 


there  exist  E  >  0  and  q  >  0  (which  depend  on  p)  such  that. 
p  II  0„«”  -  0  lle  0  a.s.  as  n  ->  « . 


fl¬ 


it  is  well  known  that  if  9  e  ©^,,a  and  m  +  a  >  — .  the  Fourier  series  of  9 

converges  absolutely  and  uniformly.  Then  a  few  additional  computations  give  us 
the  following  relation  : 
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_E_ 

sup  I  e„en  (r)  -  ene  (r)  I  <  n  2  II  0n«"  -  0„E  II  (3.12) 

o<  t<  1 

(for  n  >  n0  ( 0)). 

Therefore,  in  order  to  show  the  uniform  norm  consistency  of  0„En  at  the  rate 

a  ,  P+  f 

np,  it  suffices  to  obtain  L  -consistency  at  the  rate  n  .  The  next  result  gives 

£ 

sufficient  conditions  for  this  to  happen  (Roughly  speaking,  we  put  P  +  —  in  place 
of  p  in  Theorem  3.3  ) 


Theorem  3.4  Let  0  €  ©  m  +  (x  >  1.  Then  for  every  p,  p  <  —m  +  — — 

2(m  +  a)+l 

there  exist  £  >  0  and  q  >  0  (which  depend  on  p)  such  that, 

sup  i  dnen  (/)  -  0  (/)  I  -»  0  a.s.,  as  n  -»  «*> . 

05  f<  l 


The  next  result  is  an  obvious  consequence  of  the  above  when  0  is  assumed  to 
be  infinitely  differentiable,  that  is  if,  0  e  Qm  a  for  every  m  . 

Corollary  3.4  Let  0e  @m  a,  for  every  m;  then, 

for  every  P  <  -y  there  exist  £  >  0  and  q  >  0  such  that 

nP  sup  I  0.€n  (r)  -  0  (?)  I  — >  0  a.s.,  as  n  —>  °° . 

0<  (5  l 


Comparison  of  results:  Let  [f }),  j>  lbe  an  arbitrary  but  fixed  CONS  and  0t  n  be 
the  sieve  estimator  corresponding  to  the  sieve  [Sk,k>  1  j;  Sk  being  the  k- 
dimensional  subspaces  spanned  by  f Jt  1<  j<  k.  Without  assuming  any  regularity 
on  0,  Nguyen  &  Pham  (1982)  proved  that  0^"  converges  to  0  in  probability  in  L  1 
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norm,  when  kn  — >  00  slower  than  s  n  .  The  rate  of  covergence  of  0^  n .  however, 
was  not  discussed.  We  were  not  able  to  obtain  a  result  of  this  type  using  the 
method  in  Section  II  but,  in  view  of  Theorems  3.2  -  3.3  it  is  clear  that  in  the  pres¬ 
ence  of  mild  regularity  on  0  this  method  works  quite  well.  Next  let  us  consider  the 
consistency  result  of  Kutoyants. 

Assume  that  9 e  @m  a,  for  some  m  and  a  such  that  m  +  a  >  0. 

Suppose  (jc  1f . xnt)  are  n  i.i.d.  copies  of  (*f),0<  t<  1.  For  j  =  l,...n  and 

m  +  a 

z„  =  n  2('n  +  a>+1  define 

Yn.j(t)  =  (Vr1  if  I.KtJl>Zn 
-  z  " 1  otherwise. 


Then  the  kernel  estimator  of  Kutoyants,  9„ 

I  -  1 

e„(0  =  —  I  J  K{ 


n  a „ 


i=  I  0 


(t )  say,  is  defined  by 


1 

where  an  -  n  +  and  K{.)  is  some  bounded  kernel.  The 

weak  pointwise  consistency  of  9„  (t)  is  proved  by  showing  that,  for  every  c>  0 
and  d<  1, 


( '  m  +  a) 

lim  sup  E  {  n  4(m  +  a)'*' 2  I9n  ( f )  —  0,  l  }2  < 

n  — >  o®  c<  t<>  d 


00 


From  the  above  it  is  clear  that  for  every  8,  Q  <  - ,  there  exists  a 

K  K  4(m  +  ot)+  2 

choice  of  zn  and  an  such  that, 

n*3  I0n(f)-9,l  -4  0  as  n  — >  00  in  probability. 


Let  us  compare  this  result  with  Theorem  3.4.  Clearly,  Kutoyants’s  result  is  more 
widely  applicable  than  Theorem  3.4;  it  is  applicable  for  9e  @m  a,m  +  a  >  0 
whereas  for  Theorem  3.4  we  require  that  m  +  a>  1.  However,  the  possible  rate 
of  convergence  of  0*  exceeds  the  possible  rate  of  convergence  of  the  kernel 
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estimator  Qn  (!)  when  it  is  assumed  that  0  e  6m  a,  m  +  a  >  2.  It  is  interesting 
to  note  that  even  if  0  is  assumed  to  be  very  smooth  i.e.  if  0  e  @m  n  for  every  m  , 

the  largest  possible  (limiting)  value  of  p  for  0„  ( t )  is  —  where  as  for  Qkn  it  is 

-y  (Corollary  3.4).  It  should  also  be  emphasized  that  the  convergence  in  Theorem 
3.4  is  in  the  uniform  norm. 

Proofs:  First  we  prove  Theorem  3.1.  In  view  of  (3.3)  the  statement  of  the 
theorem  is  equivalent  to  the  statement  with  1 1.  1 1  norm  replaced  by  1 1.  1  le  norm. 
This  equivalent  statement  is  proved  by  verifying  the  conditions  of  Corollary  2.1. 
The  verification  is  done  with  the  help  of  several  lemmas  The  first  two  lemmas  are 
technical.  Lemma  3.1  bounds  the  higher  moments  of  the  L 1  norm  of  a  square 
integrable  Gaussian  process  in  terms  of  its  second  moment  and  Lemma  3.2  gives 
bounds  on  its  moment  generating  function.  Both  these  results  are  important  in 
their  own  right  and  their  proofs  must  be  available  in  the  literature  but  since  the 
auther  was  unable  to  find  the  original  sources  brief  proofs  are  sketched. 

Lemma  3.1  Let  (K,),0<  t<  T  be  a  zero  mean  Gaussian  process  such  that 
T 

E  j  Y  2  dt  <  .  (313) 

o 

Then,  for  all  k  >  1, 

T  k  t  k 

£  (  {  Y'2  dt  )  <  kk  [  E  (\  Y  2  dt  )  I  . 

0  0 

Proof  Let  R  (t,s)  =  E  Y ,  Y s  and  let  R  be  the  corresponding  covariance 
operator  defined  on  L2[0,T].  The  condition  (3  13)  implies  that 
(Y ,)  e  L  2  [0,7]  a.s.  and  that  £  is  a  seif  adjoint,  non-negative  definite,  trace  class 
operator.  In  fact, 

r  t 

tr(R  )  :=  Trace  (R  )  =  J  R  (t  ,t)  dt  =  E  j  Y  2  dt.  (  3  !4i 

o  o 
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Since  (7f)  <=  L~  [0,7]  a.s.,  we  can  expand  it  in  terms  of  any  fixed  complete 
orthonormal  system  (CONS)  in  L  2  [0,7 ].  If  the  CONS  chosen  is  the  one  consist¬ 
ing  of  the  eigenfunctions  of  the  covariance  operator  R  ,  the  resulting  expansion  is 
known  as  the  Karhunen-Loeve  expansion.  Let  [f  j  }  be  the  system  of  eigenfunc¬ 
tions  of  R  and  Xj' s  be  the  corresponding  eigenvalues  then  the  Karhunen-Loeve 
expansion  of  ( Yt )  can  be  written  as 

Yt  =  £  'JXjXj  fj(t)  a.s.  (3.15) 

j=  i 

where  the  coefficients  (X.)  are  i.i.d.  /V(0, 1)  random  variables. 

From  the  above  the  following  identities  are  immediate  : 

T 

\  Yt 1  dt  =  £  Xj  Xj2  and  (3.16) 

o  7=1 

T 

E  \  Y2  dt  =  £  Xj  =  tr(R  ).  (3.17) 

o  7=1 

Therefore, 

T  *  k 

E{\  Y  2  dt  )  =  E  (  £  X}  X2  ) 

0  7=1 

o©  ^  OO  1  OO  k 

=  E  ((  I  Xj  )  [  (  I  Xj  )  I  Xj  Xj2  ]  }. 

1  1  7=1 

Applying  Jensen’s  inequality  for  the  quantity  in  the  square  bracket  we  get 

r  *  *  k-i  « 

£  (  {  )  <  (  £  \,  )  £  f  £  Xj  X2k  ) 

0  1  7=1 

=  ( £  )*  E  x i2i 

1 

since  Xj ’s  are  i.i.d..  Moreover  because  X  j  is  N  (0,1),  using  Stirling's  approxima¬ 
tion  (see  Rao  (1973)  pp  59), 
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2k  _  (Jill  <  kk 
1  2*  k  ! 


So  finally  we  have 


<1  >7 


dr 


* 

)  < 


OO  k 

kk  (  I  X, :  )  ; 

l 


which  completes  the  proof  (  see  (3.17)). 


Lemma  3.2  Assume  that  the  Gaussian  process  ( Y, )  is  as  in  Lemma  3.1  and  R  is 
the  corresponding  covariance  operator,  also,  let  I  \R  1 1  denote  the  operator  norm  of  R 
then. 


i)  For  0  <  a  <  (  2  Ilf?  II  ) 
T 


- 1 


£  exp  (  a  j  Yt2  dt  )  <  exp  (  — 

o 


a  tr  (R  ) 


2a  II R 


)■ 


(3.18) 


If  0  <  a  <  (  2  tr  (R  )  ) “ 1  the  following  weaker  version  of  the  above  inequality 
xan  be  useful. 

T 

£  exp  (  a  |  Y, 2  dt  )  <  exp  (  ■  °  [■' — y-  -  ).  (3.19) 


•2a  tr(R  ) 


ii ) 


T 

E  exp  (  -j  Y  2  dt  )  < 

o 


exp  ( 


-  tr(R  ) 
1+211/?  II 


< 


exp  ( 


-  tr(R  ) 
1+2 tr(R  ) 


(3.20) 


T 

Note  that  tr(R  )  =  j  EY  2  dt. 
0 


J 


0 


Y,ldt  =  ix,  Xj2 
/=  i 


Proof  From  (3.15) 


where  Xj ’s  are  eigenvalues  of  R  and  Xj's  are  i.i.d.  N  (0, 1).  Note  that  if  A1  is  a 

_  j_ 

IV  (0,1)  variable  then,  for  (3  <  — ,  E  exp  ((3X2)  =  ( 1  —  2 [3 )  2  .  Therefore, 
using  the  fact  that  II/?  1 1  >  Xj  for  all  j  and  a  <  (  2  It/?  II)-1  we  get 

E  exp  (  a  Xj  Xj2  )=  (1- 2a  Xj)  2  . 

Hence 

r  =»  -1 

E  exp  (  a  J  Y  2  di  )  =  ]“J  (l-2a  Xj)  2 

o  j=  t 


=  exp  (  £  In  (1-2 a  X})  ) 

1  l 

i  00  2a  X 

=  exp<T?",<,+  TTTI7,) 

°°  a  X., 


since  In  (1  +  x)  <  x  for  x  >  0. 
So 

T 


exp  (  a  f  Y2  dt  )  <  exp  (  a  £  1  -  ). 

0  !  1  2a  Kj 


The  conclusions  in  (i)  follow  because  of  the  following  observation 

1  1  1 
l-2aXj  1- 2a  II/?  II  l-2a  tr  (R  )  ' 

Proof  of  (ii)  is  also  almost  identical. 


Now,  following  the  notation  in  Section  II,  let  us  define  random  functions 
Zkn  ( u )  with  domains  U kn  as  follows  : 


-  25  - 


For  n  >  1,  k  >  1  and  (3  >  0  let 
Ukn  =  (  Sk-Qk  ) 


(3.21) 


and 

Zkn(u)  =  /n(8i  +  n-(5  (3.22) 

where  0*  is  the  projection  of  9  in  S*  and  /„(0)  is  the  loglikelihood  function  at  9 
when  the  sample  size  is  n. 

From  (3.4)  and  the  fact  that  Pn  9  is  a  product  measure  the  following  expres¬ 
sion  for  the  loglikelihood  function  is  obtained  : 


dPn 

/„(0, co")  =  In  - -  (co") 

dPn  o 

n  l  l 

=  X  f  ef  co‘,  4  j  9f 2  (CO1,)2  dt.  (3.23) 

1=1  0  2  0 

The  next  lemma  verifies  the  first  condition  of  Corollary  2.1. 


Lemma  3.3  For  u  s  Ukn, 

E  1  Zkn(u)  -  Zkn(v)  \2k  <  Bkn  llu-v  lle2* 

where 

Bkn  =  „2^(l-P)+l  Ic 3*  ||0||e2*  (  1 6/V/ 0) lk  . 

Proof  From  (3.23), 

Z*"  (a)  -  Z*"  (v)  =  /„  (Qt  +  n-P  w )  -  /„  (9fc  +  n-p  v) 

„  n  ; 

=  n  13  X  {  J  (u,  -  V,)  (O', 
i=  1  0 

-  4-  /  f  “),2  -  v),2  ]  (co*,)2  dt ). 

2  0 

m  4  m 

Hence,  using  the  elementary  inequality,  (  £  a,  )  <  mk  (  £  ),  for 

i  i 
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positive  numbers  a{  repeatedly,  and  the  fact  that  (co',)  are  i.i.d.,  we 
£  I  Z  kn  ( u )  -  Zkn  (v)  I2*  by  the  following  expression: 

l  2k 

n  2lk  n  2k  ( 1_  ^  E  (  j  ( ut  -  v, )  co,  doit  ) 
o 

l 

+  n  n2k  E  (  j  [  (Qk  +  n~V  u)2  -  (Qfe+rt'p  v),2  ]  (co,)2  dt  ) 
o 

We  will  bound  each  term  in  (3.24)  separately.  First 
1  2k 

E(  J  (w,-vr)  co,  d(£),  )  : 
o 

Note  that,  under  PQ,  (co,)  satisfies  the  following  SDE  : 

diot  =  0,  co,  dt  +  dWt  (to)  ,  COq  =  X0 

where  W t  (co)  is  the  standard  Wiener  process  under  P e,  and  X0  is  a 

r.v.  independent  of  W t  (co).  Therefore 

1  1 

/  ( ut  -  v,)  cof  da>,  =  j  (w,  -  V,)  0,  co2,  d/ 
o  o 

l 

+  J  (w,  -  v,)  co,  ^(co). 

o 

1  2k 

Hence  £  (  j  (m,  -  v,)  co,  cfco,  )  is  bounded  by 
o 

1  2k 

22k  {  £  (  J  (m,  -  v,)  0,  co2,  ) 

o 

1  2k 

+  £  (  J  (u,  -  v,)  co,  c/W,  (co)  )  ). 

o 


can  bound 

2k 

.  (3.24) 

consider 


N  (0,a02) 


(3.25) 


The  first  term  in  the  bracket  above  can  be  bounded  by  the  repeated  application  of 
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Cauchy-Schwariz  inequality  and  finally,  of  Lemma  3.1  by 

{  k 2k  llu-vl  leu  1 10 1  le4,t  } 2 . 

Thus, 

l  2* 

E  (  j  (ut  -  vt)  0,  (02,  dt  )  <  kk  I  Iw  -  v  1 102*  II0II02*.  (3.26) 

0 

1  2  k 

Next  consider  E  (  j  (u,  -  vt)  co,  dW t  (co)  )  (  the  second  term  in  (3.25)  );  it 

0 

can  be  bounded  by 

l  * 

(8A:)2*  E  (  J  (u,  -  v,)2  (o2t  dt  )  . 
o 

The  above  bound  follows  from  the  Burkholder  -  Davis  -  Gundy  (BDG)  inequality 
stated  here  for  the  reader’s  convenience  (see  Dellacherie  and  Meyer  (1982),  pp  287 
where  a  general  inequality  is  given). 

BDG  inequality  Let  ( Mt ),  0<  t<*  T  be  a  square  integrable  martingale  with 
squared  variation  process  <  M  >  ,  then, 

E  sup  \Mt\2p  <  (8 p)2pE<M>jp 

0  <isr 

for  all  p  >  1.  Therefore,  using  Lemma  3.1,  we  get 
l  2  k 

E  (  j  («,  -  vt)  a),  (o))  J  fS  82*  k3k  llu-v  I le2,t .  (3.27) 

o 

Hence,  in  view  of  (3.25),  (3.26)  and  (3.27),  the  first  term  in  (3.24)  can  be  bounded 
by 

llu-v  ll92*  „2*(1-P)+i*3*  84*  |ieile2* 


The  second  term  in  (3.24)  can  be  bounded  in  the  same  fashion  and  then  combining 
these  two  bounds  the  lemma  follows. 
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The  second  condition  of  Corollary  2.1  is  verified  in  Lemma  3.6,  the  following  two 
lemmas  are  preparatory. 

Lemma  3.4 

_  (3  1  1 

£ exp( -~-j  u,cotdcot--U  [(Q.k  +  n~^u)t2-(Qk)2]u>l2dt) 

4  o  8  o 

<  (£exp(--j|  [n_,3w-2(0-ei)]t2a}t2cfr))  .(£exp(J  ( 0-  0* ) ,  2cot  2dc ) )  . 
4  o  o 

Proof  Recall  that,  under  Pe  the  paths  (co,)  satisfy  the  following  SDE: 

da),  =  0,  to,  dt  +  dWt  (co)  ,  co0  =  X0. 

Hence,  writing  0,  =  (0- 0*  ),  +  (0*  ),  we  get 


-pi  1 

~r ut“>tdo),-—  I  [(Qk  +  n-Pu),2-(&k)'2]ooi2dt 
4  o  8  o 


since  the  expression  inside  the  bracket  is  in  fact - (co)  i.e.  it  is  a  density. 

dP  0 

Therefore,  applying  the  Cauchy -Schwartz  inequality, 

-p  1  1 

Eexp(-—-j  u,0),d  0),—  ~ J  [(§k  +  n-Vu)l2-(&k)l2]a)l2dt) 

4  o  8  o 

i  i  1 

£  (£exp(— — J  uta),dW,( co) - — j  u2w,2dt)) 

1  0  8  0 

-2 p  1  -p  1  7 

x  (£exp( - — -\  u,  2a),  2dt  +  — — j  u,(B-Qk),a)t2dt)) 

8  0  L  0 
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=  1  .  (E  exp(-  -j-J  [n  Put-2(Q-Qk)t]2u>t2dt  +  ]-[  (0-  Qk  ),  2co,  2dt ) )  "  . 

8  o  1  o 

The  conclusion  of  the  lemma  follows  if  we  apply  the  Cauchy-Schwartz  inequality 
again. 


Lemma  3.5  For  ueUn k  and  k  large  so  that 

1-21  ie-efc  i  ie2  >  o, 

l  -n  I  l/z -^-2(0-0*)  lle2 

Ec\p(—Zn k(u))  <  exp(— — - - - - - - 

4  16  1+  l!/!-^-2(0-0*)!l92 


) 


n  110-0*  lle2 

x  exp( - — 

4(1-21  le-e*  1 102) 


Proof  Note  that  Zkn  can  be  written  as 

»  n  1  .  .  ,  1 

Zkn(u)  =  u,(i)lld(Olt~—j  ([0*  +  n~PM]f2-[0*](2)(to‘,)2rfr 

<=io  L  o 

where  (G)',),0<  r<  1,  i  =  1,  2  .  n  are  i.i.d  processes.  From  this  independence 
and  Lemma  3.4  we  get, 

Ec\p(^Zn  k(u)) 

1  j  l  -j 

^  (Eexp(--H  [n-Vut-2(Q-$k)t)\2dO)  (£  exp ({  (0-0* ),2co,2rfr))  ' 

4  o  o 

To  obtain  the  lemma  we  now  apply  Lemma  3.2  (ii)  to  the  first  term  and  (i)  to  the 
last  term  of  the  product  in  the  RHS  above. 

Let  us  bound  the  expression  in  the  last  lemma  by  a  slightly  simpler  expres¬ 
sion. 
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First  note  that  since  I  I  lx  1 10-  I  ly  1 19  I  <  I  Lx  —y  1 le, 


_n  i  i  ie2 


i/l"P  llu  1 1«-  2110-8*.  1 l«  I2 


16  1+  11/1-014-2(0-0*)  lie2  16  1+  I — 2(0— a* )  I  !e2 

Furthermore  since  u  e  Unk  =  n$(Sk-Qk), 

Wu  ll0  <  2 yjW,kq 


and,  for  large  k  ,  k  >  kQ, 


1+  I  — 2(0-0* )  Me2  <  CQklq 
(since  119-9*  II2— *0  as  k  -4  m),  Therefore, 


_n  I  ln“ Pm- 2(0- 0k)  I  i92  -nCs  , 

- - - - In’PlU  il9- 2116- a*  H0  lb.28) 

16  1-H  I  lrt-PM-2(9-0jfe)  I  le2  k2q 


Also  for  large  k  ,  1—21 10—  0^  1 102  >  —  hence 
n  1 10-  9*  1 192 

- - — - — —  <  2n  110-0,  lle2. 

1-21 10-  0fc  I  le2 


(3.29) 


Therefore,  from  (3.28),  (3.29)  and  Lemma  3.5  we  get, 


E  exp(  —  Znk(u))  <  exp(gn*(  Wu  lle)) 


(3.30) 


where 

The  conclusion  in  (3.30)  is  stated  as  the  next  lemma  (which  verifies  the  second 
condition  of  Corollary  2.1) 


Lemma  3.6  Foru&Unkandk  >  k 0 


E  exp(—  Z%(u))  <  exp {gn k(  I \u  i l0)) 
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where 


8\(x) 


~  nC  0 
k2< 


\n~VX-2  110-0*  lle  I2-  In  1 10- ^  lle2. 


It  is  easy  to  check  the  remaining  conditions  of  Corollary  2.1.  Note  that,  here, 

1)  m=a  =  2k, p  =  0 

2)  B\  <  n2*0-P)+1  **( 3+2<?)  C2k  and 

3)  gnk(x )  =  C0«5/:_^  lx -a  («  ,k  )  lv- t(n  ,k  ). 
where 

5  =  1-2(3,  (i  =  2q,  v  =  2, 
a(n,k)  =  2 1 10—  9^  ll0  and 
t(n,k)  =  2n  1 10— Q*  1 102. 

Also  recall  that,  by  assumption,  0€  @E  and  1-2(3-2e-2e£?  >  0.  Therefore  if 
we  take  s  =  E  >  0  then 

8-ii5-2s  =  1-2(3-2e-2e<7  >  0. 

Now  let  us  see  if  we  can  show 
i)  a(n  ,[rt£])  =  2n^  I  !0-0„« I  Iq  —>  0  and 

H)  !.].).  =  4n 1-6  1 19-  Q„,  1 192  -»  Oas*  ->  °°. 

[nE] 

(ii)  is  easy  to  see  because  0e  ©e  implies  that 

^1_e  II0-^£0II2  -*  0 

which  implies  that 

n 1-e  H0-0nell92  ->  0; 

which  is  (ii).  It  also  shows  (i)  since  1-E>  2(3.  This  concludes  the  proof  of 
Theorem  3.1. 

Now  we  show  that  Theorems  3.2  -  3.5  follow  from  Theorem  3.1. 
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Proof  of  Theorem  3.2  Obvious  from  Theorem3.1,  the  Borel-Cantelli  lemma  and 
the  following: 


^  II  0„en  -  0  II  <  II  0  ,n  -  0*  II  +  II  0  -  0„ 


Before  starting  the  proof  of  Theorem  3.3  let  us  review  some  standard  facts  from 
Fourier  theory.  These  are  stated  as  propositions.  From  now  on,  { /  ■  }  will 
always  denote  the  CONS  defined  in  (3.9)  i.e.  the  one  with  the  trigonometric  func¬ 
tions.  Also  recall  that,  for  m  >  0  and  0  <  a  <  1,  ©m  a  is  the  space  of  m- 
times  continuously  differentiable  functions  with  m  -th  derivative  Lipschitz  continu¬ 
ous  of  order  a. 

Proposition  3,1  Let  0  €  ©m  a.  Then  there  exists  M  =  M  (m  ,a,0)  such  that 

l 

1(0-  Pn  0  ll2=  J  [  0,  -  (  Pn  0  ),  ]2  dt  <  M  /i-;(m  +  a). 
o 

The  above  inequality  is  a  simple  consequence  of  a  result  called  Jackson’s  ine¬ 
quality  [see  Zygmund  (1949),  Theorem  13.6,  pp  115,  vol  I.]  and  the  fact  that  Pn  0 
is  the  projection  of  0  on  the  linear  span  of  { fj  },  1  <  j  <  n\ 

Corollary  3.5 

©_  a  c  ©e  for  all  e  >  — - 

m’a  e  l  +  2(m  +  a) 


Proof 


We  have  to  show  that  0  6  ©_  £  >  - - - , 

m'a  I+2(m  +  a) 

n  l~ E  1 1  0  -  Pnc  0  1 12  — >  0  as  n  — >  «» 


implies  that 


From  the  last  proposition. 
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nl~£  lie  -  Pnc  0  1 12  <  M  „l-£-2e(m  +  a; 

The  RHS  above  tends  to  zero  if  l-e-2e(m  +  a)  <  0  i.e.  if  £  >  - - - . 

1+  2(m  +  a) 

which  is  the  required  conclusion. 

The  next  proposition  shows  that  the  Fourier  coefficients  of  functions  in  Qm  a 
enjoy  nice  convergence  properties  (see  pp  240,  Zygmund  (1949)  vol  II  and 
Theorem  5,  pp71,  vol  I). 

Proposition  3.2 

i)  If  0  e  ©m  a,  m  +  a>  ~  then 

sup  I0(r)  I  <  V  2  y  10.  I  <  oo  . 
o<  t<  l  \ 

where  0 ,  ’s  are  the  Fourier  coefficients  of  0. 

ii)  0e  0m  a,  implies  that 

I  0,  I  -  O  ( j-m-ay 

Proof  of  Theorem  3.3  Follows  from  Theoerem  3.2  and  Corollary  3.5 

(m  +  a)-j 

(because  for  every  Q<  -  we  can  find  an  e  and  q>  0  such  that 

2(m  +  a)+  1 

0m  a c  0e  and  l-23-2E-2e<7>  0). 

Proof  of  Theorem  3.4  :  This  result  follows  easily  from  Theorem  3.2,  Corollary  3.5 
and  Proposition  3.2.  To  see  this  first  note  that 

sup  I0nen  (r)-  0,  I  ^  ^2{£  10,-0, :  1+  £  10,1). 

05  1  j=  l  nE+  1 

Hence  the  result  is  proved  if  we  show  that 

a)  I0;n-0jl  — >  0  and 

;=i 


-  34  - 


b)  n  ^  X  !6j  I  —>  0  as  n  — >  « . 

nc-f  l 

First  let  us  show  (a).  Using  the  Cauchy  -  Schwartz  inequality, 

n  e  n  e  J_ 

X  I0/-0,  I  ^  «  2  (  X  I  djn-  Qj  I2  )  2 

7  = 1  7=1 

Therefore  (a)  is  proved  if  we  show  that 

p+1.  . 

n  2  1 1 0n£n  —  0rtE  1 1  — >  0  a.  s.  as  n  — » 

But  this  follows  from  Theorem  3.2  because  for  B<  .  i m  +  a) — L  wc  can  finc[  an  e 

2(m  +  a)+  1 

£ 

and  q  such  that  @m  ac  0e  and  1- 2((3+ — )- 2e~  2 eq>  0.  Now  consider  (b). 
From  Proposition  3.2  (ii),  there  exists  a  C  such  that  for  large  n  , 

n*  x  ie7- 1  <  c  x  rm~°. 

ne+  1  nE+  1 

Hence 

X  l0;1^  C  (rte)1_m'a  <  C  nP+£(1'm'a). 

nc+l 

The  RHS  above  tends  to  zero  if  P+E(l-77j-a)  <  0.  Again,  for 

Be  +  — —  one  can  find  an  £  such  that  0_  „c  0F  and  (3+  £(  1  -  m  -  a)  <  0; 

2(m  +  a)+  1  m.a  e 

and  thus  (b)  is  proved. 

IV  Intensity  estimation  of  Poisson  process: 

Let  (Nt),0<  t<  1  be  a  Poisson  process  with  intensity  function 

t 

9  =  (  0,  ) .  0  <  f<  1;  i.e.,  M,  =  N,  -  j  05  ds  is  a  square  integrable  martingaie 

0 

with  square  variation  process 

t 

<  M  >  ,  =  |  ds. 

o 


(4.1) 
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The  deterministic  function  0  is  assumed  to  be  positive,  bounded  and  bounded  av*a> 
from  zero.  The  aim  is  to  estimate  0  using  a  sample  of  n  independent  trajectories 
of  ( Nt),0<  (<  1.  This  problem  is  treated  as  one  of  the  exam  pies  in  a  paper  b\ 
A.F.  Karr  (  1987).  There  he  proposes  a  sieve  estimator  (Histogram  sieve  estimator) 
which  is  shown  to  be  strongly  consistent  in  the  L  j  norm.  The  rate  of  convergence 
of  this  estimator  is  not  discussed.  As  in  the  case  of  Section  111,  for  this  example, 
the  procedure  suggested  in  Section  II  yields  sieve  estimators  possessing  good  con¬ 
sistency  properties.  A  detailed  discussion  of  the  rate  of  convergence  of  these  esti¬ 
mators  is  given  after  developing  the  necessary  notation.  All  the  proofs  are  given  at 
the  end. 

Let  D  =  D  [0,  1]  be  the  space  of  right  continuous,  left  limit  functions  on 
[0,  1],  endowed  with  the  Skorokhod  topology;  D  will  denote  the  o  -  tic  Id  of  Borcl 
sets  in  D . 

Let 

0  =  (  /  e  L  2  ( [0,  l],  dt )  :  f  >  0,  sup  f  (t)  <  «>  and 

o<  t<  l 

inf  f  (r)  >  0  }.  (4.2) 

0<  r£  1 

0  will  serve  as  the  parameter  space  for  our  problem. 

Suppose,  for  0  6  0,  P0  is  the  measure  induced  on  (D,D  )  by  the  paths 
of  a  Poisson  process  with  intensity  0;  then  if  9,  [i  e  0,  it  is  well-known  that  (see 
Karr  (1987),  pp  475),  P0  and  are  mutually  absolutely  continuous  and,  for 
co  e  D, 

dP  *  ^  0 

(co)  =  exp  (  {  ( (i,  -  0, )  dt  +  j  In  (  —  )  d(o,  ).  (4.3) 

u  0  0  M’X 

In  what  follows,  (Dn,  Dn  )  will  denote  the  n-fold  product  of  (D,D  ),  co''  a 
typical  point  in  Dn  and  Pn0  the  n-fold  product  of  P0. 
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*  H 

Following  the  method  from  See  II,  a  sieve  estimator  0  k  is  proposed  to  esti¬ 
mate  0.  The  loglikelihood  function,  from  (4.3),  can  be  written  as, 

dPn 

l  (  0,(0"  )  =  In  - -((o'1) 

dPnl 

i  „  i 

=  n  j  (  1  -  0.  )  dt  +  £  J  In  (0,)  rich1, .  (4.4) 

o  /=  l  o 

(  jl  is  taken  to  be  identically  equal  to  1  and  co‘  are  components  of  con;  under  Pn  e 
they  are  i.i.d.  Poisson  processes  with  intensity  0.) 

Two  different  types  of  sieves  S k  are  used. 

I)  For  k  >  1  and  q  >  0  let 

k 

Sk  =  [  0e  ©  :  0=  £  0y-/;-,  k~q  <  0,  <  kq  for  all  t  }  (4.5) 

7=1 

where  for  I  <  j  <  k,  f }  (r )  =  l  j-j_  j_  (0  . 

[  k  '  k  ) 

(This  sieve  is  some  times  referred  to  as  the  Histogram  sieve.) 

H) 

k 

Sk={Qe<d:Q=^9jfj,  k~q  <  0,  <  kq  for  all  t  } 

7=1 

where  for  j  >  1,  f  j’s  are  defined  as  follows  : 

Take  f  \  =  1  and  for  j  >  2 

f j  ( t )  =  V  2  cos  (n Jt)  for  even  j's  and 

=  V  2  sin  [7t  (/+  1)  r]  for  odd  j's.  (4.6) 

The  results  in  this  section  are  almost  identical  to  the  results  in  the  last  section. 

To  obtain  consistency  results,  using  the  method  from  Section  II,  we  need  some 

additional  "smoothness"  assumptions  on  the  Fourier  coeficients  of  0.  Indeed,  in 
order  for  the  the  main  consistency  result.  Theorem  4.1,  of  this  section  to  .'e  appli¬ 
cable,  we  require  that  0e  Wt  q  for  some  e>  0  and  q  >  0  such  that 
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\-2eq-2e>  0,  the  class  IV  being  defined  as  follows: 

W  £q  =  (06  0:  /j1+2e«-£  I  IP„e0-  6  1 1  — »  0  }  (4.7) 

where  Pm0  is  the  projection  of  0  on  the  span  of  {fj,  1  <  j  <  m  },  f }' s  are  as  in 
(2.4)  or  (2.5)  and  1 1. 1 1  denotes  the  L  2-norm. 

The  class  W  £  q  corresponds  to  the  class  0e  of  the  last  section  and  in  this 
case  also  if  the  fj’s  used  in  defining  W  £q  are  the  trigonometric  functions  given 
in  (4.6),  it  is  easy  to  show  that, 

®m,a  c  w  z.q  for  e(l+(m  +  a)-2q)  >  1.  (4.8) 

Recall  that  a  is  the  class  of  m  -times  continuously  differentiable  functions  with 
m-th  derivative  Lipschitz  continuous  of  order  a. 

We  state  the  results  of  this  section  without  any  further  discussion,  the  rela¬ 
tions  among  different  results  here  being  exactly  identical  to  the  relations  among  the 
corresponding  results  in  the  last  section. 

For  the  next  two  results  0*”  can  correspond  to  either  of  the  two  sieves 
given  in  (4.5)  and  (4.6). 

Theorem  4.1  Assume  that  the  true  parameter  0  belongs  to  W  £  q  for  some  £  >  0, 
and  q  >  0  where  \-2z-2zq  >  0.  Choose  p  >  0  such  that 

b  =  1  -  2P  -  £  -  2e q  >  e  >  0. 

Then,  for  all  n  sufficiently  large  and  h  >  0  , 

Pn Q  {  II  entn  -  Qn*  II  >  h  }  <  exp  (  -  C  nb  h2  ) 

where  C  >  0  ,  depends  on  0  alone. 

Theorem  4.2  Assume  that  0e  W£  q  for  some  £,  and  q  >  0  where 
l-2e-2 zq  >  0.  Choose  p  >  0  such  that 
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1  -  2(3  -  £  -  2 zq  >  e  >  0. 


Then. 


II  0n/*  -  0  1 1  — >  0  a.s.  as  n  — »  »  . 


For  the  following  results  it  is  assumed  that  the  CONS  { /  ■  }  used  in  defining  the 
sieve  Sk  is  the  one  given  in  (4.6)  (i.e.  consisting  of  trigonometric  functions). 


Theorem  4.3  Lee  6e@m  a,  m  +  Ct>  1.  Then,  for  every  (3  < 
exist  £  >  0  and  q  >  0  such  that 


(m  +  a)-  1 
2  (m  +  a)+  2 


.  there 


II  0ncn  -  0  II  — >  0  a.s.  as  n  — »  °o . 


Theorem  4.4  Let  0  6  0m  a,  m  +  Ct>  2.  Then,  for  every  S<  ■  ^ m  +  — —  there 

2(m  +  a)+  2 

ejt/jr  e  >  0  and  q  >  0  such  that 

np  sup  I  0  c'1  (r)  —  0  (r)  1  — >  0  a.s.  as  n  -»  <*> . 
o<  ;<  1 


Corollary  4.1  Let  0  6  Qm  a,  for  every  m;  then, 

for  every  (3  <  there  exist  e  >  0  and  q  >  0  such  that 

n15  sup  I  0  tn  ( t )  —  0  (r)  I  0  a.s.,  as  n  . 

0<  IS  1 

Proof  of  Theorem  4.1  :  This  theorem  will  be  proved  using  Corollary  2.1.  The  con¬ 
ditions  of  this  corollary  are  verified  using  several  lemmas.  The  first  lemma. 
Lemma  4.1,  is  a  simple  result  from  point  process  theory.  It  can  be  easily  proved 
first  by  verifying  it  for  simple  functions  and  then  extending  it  to  more  general 
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functions  using  the  standard  techniques. 

Lemma  4.1  Suppose  (  Nt  )  ,  0  <  t  <  1  is  a  Poisson  process  with  bounded  inten¬ 
sity  (  6,  )  then,  for  all  f  ,  bounded  and  deterministic, 
l  1 

E  exp  (  J  fs  dNs  )  =  exp  (  -  j  (l-  exp  (fs)  )  0,  ds  ). 
o  o 

Let  us  recall  the  definition  of  the  sieve  S k.  For  k  >  1  and  q  >  0, 

Sk  =  (060:0  =  £  Qjfjt  k-i  <  0(f)  <  k\  re  [0,1]  ) 
i=  1 

where  { / .  },  1  <  j  <  k ;  is  a  set  of  independent  vectors  given  in  either  (4.5)  or 
in  (4.6). 

For  0  6  0,  let  0*  denote  its  projection  in  Sk.  Recall  from  (4.4)  that,  the 
loglikelihood  function 

l  „  l 

/  (0,  C0n)  =  n  j  (1  -  0t)  dt  +  X  /  In  (fy)  (4.9) 

0  i=  1  0 

where  under  PnQ,  (  cor *  ),  0  <  t  <  1  are  i.i.d.  Poisson  processes  with  common 
intensity  (0,),O<  t  <  1. 

As  usual,  let  us  define  random  functions  Znk  ( u )  with  domains  Ukn  as 
follows  : 

For  n  >  1  and  k  >  1,  let 

Ukn  =  a*  (  Sk  -  a*  )  (4.10) 

and  for  ue  Ukn , 

zkn  (u)  =  /  ( u )  -  i  ( 0* ).  (4.1D 

Let  gkn  :  [0,«>)  — >  R  1  be  given  by, 

gkn  (x)  =  C  n1"2^  k~2<*  x2-  n  k2<*  II  0  -  0*  II  (4.12) 


where  C  >  0  depends  on  0  alone. 


Lemma  4.2  ForueUkn, 

E  exp  (  i-  Zkn  (u)  )  <  exp  (  -  gkn  (  \\u  II  )  ). 

Proof  :  In  view  of  (4.9)  and  (4.1 1), 

i_R  r  n  9 k  +  n~^  u 

Zkn  ( u )  =  - n  p  f  u,  dt  +  £  In  (  - - - ),  d co‘,. 

o  1  SJt 


Recall  that,  under  PnQ,  (  co/  ),  0  <  t  <  1  are  i.i.d.  Poisson  processes  with 
common  intensity  (0f),  0  <  t  <  1.  Hence, 


1 

E  exp  (  —  Zkn  (u)  )  =  exp  ( - —  J  u,  dt  ) 

i  i  o 

n  1  0t.  +  n~^  u 

.  fj  E  exp  (  —  In  (  - - - )t  d co'(  ). 

1  z  S/t 

1  Q.k  +  n~^  u 

Setting  fs  =  — -  In  (  - - -  )s  and  then  using  Lemma  4.1  in  the  second 

2  0* 

factor  in  the  above  product  we  have 


1  „l-P  1 

£  exp  (  —  Zkn  (u)  )  =  exp  ( - —  j  u,  dt  ) 

i  i  0 

}  Qk  +  u  T 

.  exp  (  -n  j  [  1  -  (  - - - ),  )  Q,  dt  ).  (4.13) 

o 

Let  F  ( x )  =  1  -  (  -a  — •  —  ) 2  where  a  >  0  and  x  e  {-  a  ,a)  ■  Then  bv 

a 

Taylor’s  formula  we  have 


F  (x) 


77= ( a  +  x 
4  V  a 


for  some  x*.  x*e (0,jc)  (or  (x ,0)  as  the  case  may  be). 
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Taking  a  =  0*  (0  and  x  =  n  p  u,,  the  RHS  of  (4.13)  can  be  written  as 


=xp  r  -  —  j  U,d,  +  —  fo  u,  J-JJJ  d, 


- —  J  ,  =====  (  fi*  +  «-p  u  ),  dl 

4  0  Vfifc  (0 

9,  (  0  "  fit  ),  .  t  ,  u  u 

Writing  -  =  1  +  - —  in  the  second  term,  the  above  quantity 

e*  (o  e*  (o 


takes  the  form 


exp  ( 


1  (  9  ~  Qfc  )r 

2  Jn  9*  (r) 


-f==  (  9*  +  'l_p  M*  )r  2  *'*)■ 
4  o  V  9fe  (O 


(4.14) 


Note  that,  since  it  is  a  member  of  S* ,  9*  (0  >  k  q  >  0  for  every  t.  Let  us 
consider  the  first  term  in  the  exponent  above.  From  the 


Cauchy -Schwartz  inequality, 

L!,'  ,  jjLJkii  ,*s  „  Ji 

2  J0  '  a*  (r)  2  a* 


—  II.  1 1  0  -  9t 

A  * 


However,  note  that, 


1—2—  IS  2  nP  *2*; 

fl*  (O 

because,  (i)  g*eS*  implies  that  S  (r)  <  **  for  all  t  and  (ii)  ue(Jkn 
implies  that  I  u,  IS  2  kq\  therefore, 

I  — —  I  S  -  —  —  =  2  np  k2q  . 

fi*  <0 

Hence  II  -p-  IIS  2  k2q  and  therefore, 

jiLi  f  tu  ( ,9 zIlIl  urn  n*k2* .  lie-  e*  u.(4.i5> 

2  Jo  8*  (0  2 
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Now  consider  the  second  term  in  the  exponent  of  (4.14).  First,  since  0  €  0 
exist  m.Q  and  M  0  such  that,  0<  me  <  9,  <  Me<  °° .  Furthermore. 


-  3 

(  0*  +  rt-P  u*  ),  2 


>  k  2q  for  all  t. 


V0*  V) 

This  is  so  because,  u* :  <=  (O.m.)  fie  I  u* t  I  <  I  u(  I),  implie' 
0fc  +  n~ P  u*  is  a  member  of  Sk  (so  is  0^)  and  therefore, 

k~q  <  (  0*  +  n~P  u*  ),  <  kq 


and 


Hence, 


kq<(Qk),<kq  for  all  t. 


(  0*  +  n  P  u  )c  2 


V0*  (0 


=  {  (  0*  ),  (  0*  +  u  *  ),3 


2  >  k-2 


Therefore, 


1-2(5 


I.  rrshrr  < a*  +  *-f  >■ 2  9-  * 


4  'till  (0 


1—2(3 


<  - 


mQk  2q  II  u  II2. 


From  (4.15)  and  (4.16)  we  then  have 

.  „'-e  !  (9  -  &  ), 

exp  (  — r—  J  u,  — r — — - dt 


o 

1-2(3 


fi*  (0 


4  l71TT7y(&  +  "'9“')'  2e'^ 

nl"P  ,  n  l_2P  ,  , 

£  exp  f  -2—  2nP  k2q.\\ 9- g*  II--2-—  m0  k~2q  Mull2). 

* 


there 


;  h  ,i 


<7 


(4.16) 


(4.17) 
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Therefore,  in  view  of  (4.14)  and  (4.17),  we  have  shown  that 

£  exp(  -~-Zkn(u)  )  <  exp  (  -  Cq  n  1_2^  k~2q  I  In  I  i2  +  n  k  2q  1 1  0  —  0^.  11/ 

=  exp  (  -  gnk  (Hull)  ); 

which  completes  the  proof. 

Lemma  4.3  Foru,v  e  U  kn , 

E  I  Zkn  (u)  -  Zkn  (v)  I2*  <  Bnk  II  u  -  v  II2* 

where 

Bn  _  23i  +  l  „2i(l-p)+l  k  2k  ( 1+?)  w  2  k 
k  9 

The  proof  of  this  lemma  is  very  similar  to  the  proof  of  Lemma  3.3  and  we  omit  it. 
Lemmas  4.2  &  4.3  verify  all  the  conditions  of  Corollary  2.1  (the  verification  being 
quite  routine)  and  thus  Theorem  4.1  is  proved.  The  deduction  of  Theorems  4.2  - 
4.4  from  Theorem  4.1  is  also  straightforward. 
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